- Oversee the operation of software and services in the Coordination team.
- Leverage deep expertise in networking, routing, and traffic patterns.
- Leverage deep expertise in service discovery, distributed services, and cloud infrastructure.
- Develop monitoring, alerting, and incident response solutions.
- Contribute to ongoing enhancement efforts and champion reliability engineering best practices.
- Highly motivated team player with initiative.
- Strong debugging, documentation, and communication skills.
- Ability to work collaboratively in a dynamic environment.
- Availability for occasional travel (up to 20%).
- Bachelor's degree or above in Computer Science, Engineering, or related field.
- 5+ to 10+ years of experience in site reliability engineering or related roles.
- Expertise in relevant technologies, such as CDN operations, containerization, incident management, traffic routing, and distributed systems.
- Proficiency in scripting and automation (Python, Perl, Go).
- Strong knowledge of Unix/Linux system administration at scale.
Company
Location
London, England - United Kingdom
Job type
Full-Time
Python Job Details
Join the dynamic Site Reliability Engineering teams across various domains and locations. As an SRE, you will play a crucial role in ensuring the high performance, reliability, and security of our systems. Each team focuses on different aspects of our infrastructure.
Please note: This is an on-site role for London, UK
Team:
As a Staff SRE Engineer on the Coordination team, you will:
Who You Are:
Qualifications:
More Developer Job Boards
Fullstack Developer Jobs Golang Jobs JavaScript Jobs Python Jobs React Jobs Rust Jobs Java Jobs